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Abstract 

In this paper, we investigate the diameter in preferential attachment (PA-) models, thus quantifying 
the statement that these models are small worlds. The models studied here are such that edges are 
attached to older vertices proportional to the degree plus a constant, i.e., we consider affine PA-models. 
There is a substantial amount of literature proving that, quite generally, PA-graphs possess power-law 
degree sequences with a power-law exponent t > 2. 

We prove that the diameter of the PA-model is bounded above by a constant times logt, where t 
is the size of the graph. When the power-law exponent r exceeds 3, then we prove that logt is the 
right order, by proving a lower bound of this order, both for the diameter as well as for the typical 
distance. This shows that, for r > 3, distances are of the order logi. For r S (2,3), we improve the 
upper bound to a constant times log logt, and prove a lower bound of the same order for the diameter. 
Unfortunately, this proof does not extend to typical distances. These results do show that the diameter 
is of order log logt. 

These bounds partially prove predictions by physicists that the typical distance in PA-graphs are 
similar to the ones in other scale-free random graphs, such as the configuration model and various 
inhomogeneous random graph models, where typical distances have been shown to be of order log logt 
when T <E (2, 3), and of order logt when r > 3. 

1 Introduction 

In the past decade, many examples have been found of real-world complex networks that are small worlds 
and scale free. The small-world phenomenon states that distances in networks are small. The scale-free 
phenomenon states that the degree sequences in these networks satisfy a power law. See [3l [Ml EH] for 
reviews on complex networks, and [5] for a more expository account. Thus, these complex networks 
are not at all like classical random graphs (see [U [9l [35] and the references therein) , particularly since 
the classical models do not have power-law degrees. As a result, these empirical findings have ignited 
enormous research on random graph models that do obey power-law degree sequences. See [11] for the 
most general inhomogeneous random graph models, as well as a review of the models under investigation. 
Extensive discussions of various scale- free random graph models are given in |2H [25] . 

While these models have power-law degree sequences, they do not explain why many complex networks 
are scale free. A possible explanation was given by Barabasi and Albert [6] by a phenomenon called 
preferential attachment (PA). Preferential attachment models the growth of the network in such a way 
that new vertices are more likely to add their edges to already present vertices having a high degree. For 
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example, in a social network, a newcomer is more likely to get to know a person who is socially active, 
and, therefore, already has a high number of acquaintances (high degree). Interestingly, PA-models with 
so-called affine PA rules have power-law degree sequences, and, therefore, preferential attachment offers 
a convincing explanation why many real-world networks possess this property. There is a large amount 
of literature studying such models. See e.g. O [101 (El [El [HI [HI [171 [22] and the references therein. 
The literature primarily focusses on three main questions. The first key question for PA-models is to 
prove that such random graphs are indeed scale free [H [TOl [T2l \T3\ \T7\ [22] , by proving that their degree 
sequence indeed obeys a power law with a certain power-law exponent r > 2. The second key question for 
PA-models is their vulnerability, for example to deliberate attack |13] or to the spread of a disease [?]• The 
third key question for PA-models is to show that the resulting models are small worlds by investigating 
the distances in them. See in particular [15] for a result on the diameter for a PA-model with power-law 
exponent r = 3. In non-rigorous work, it is often suggested that many of the scale-free models, such 
as the configuration model, the inhomogeneous random graph models in |llj and the PA-models, have 
similar properties for their distances. Distances in the configuration model have been shown to depend on 
the number of finite moments of the degree distribution. Similar results are true for the so-called rank-1 
inhomogeneous random graph (see e.g. |18 [ I19 [ [26l I40j ). The natural question is, therefore, whether the 
same applies to preferential attachment models. This is the main goal of the present paper, in which we 
investigate the diameter of scale-free PA-models. 

The remainder of this section is organized as follows. We first introduce the models that we will 
investigate in this paper. Then we give the main results and conclude with a discussion of universality in 
power- law random graphs. 

In this paper, we investigate the diameter in some PA-models. The models that we investigate 
produce a graph sequence or graph process {Gm,s{i)}, which, for fixed t > 1 or t > 2, yields a graph 
with t vertices and mt edges for some given integer m > 1. In the sequel, we shall denote the vertices of 
Gm,s{t) by l'™\ . . . ,t^"^K When m is clear from the context, we will leave out the superscript and write 
[t] = {1,2, . . . ,t}. We shall consider three slight variations of the PA-model, which we shall denote by 
models (a), (b) and (c), respectively. 

(a) The first model is an extension of the Barabasi- Albert model formulated rigorously in [17]. We start 
with Gi^5(l) consisting of a single vertex with a single self-loop. We denote the degree of vertex z^^' 
at time t by D-(i)(t), where a self-loop increases the degree by 2. 

Then, for m = 1, and conditionally on Gi^sit), the growth rule to obtain Gi^sit + 1) is as follows. 
We add a single vertex (t + 1)'^* having a single edge. This edge is connected to a second end point, 
which is equal to {t + 1)'^' with probability proportional to 1 + 5, and to a vertex i'^* G Gi^sii) with 
probability proportional to D-(i){t) + 6, where (5 > —1 is a parameter of the model. Thus, 



The model with integer m > 1, is defined in terms of the model for m = 1 as follows. We start with 
Gi,5'(mt), with 6' = 5/m > —1. Then we identify the vertices l'^\2'^' . . . , m*^^ in Gi^5(mt) to be 
vertex 1^""^ in Gm,5{t), and for I < j < t, the vertices {{j — l)m + 1)^^', . . . , {jmY^^ in Gi^5'(mt) to 
be vertex j*™' in Gm,5{t); in particular the degree D-(m){t) of vertex j*™^ in Gm^sit) is equal to the 
sum of the degrees of the vertices ((j — l)m + 1)*^', . . . , (jm)'^' in Gi^s'i^t)- This defines the model 
for integer m > 1. Observe that the range of 6 is [— m,oo). 

The resulting graph Gm,s{t) has precisely mt edges and t vertices at time t, but is not necessarily 
connected. For 5 = we obtain the original model studied in [17] . and further studied in |13 [ I14 [ [T5]. 
The extension to 5 7^ is crucial in our setting, as we shall explain in more detail below. 




for i = t + 1 



for i e [t]. 



(1.1) 
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(b) The second model is identical to the one above, apart from the fact that no self-loops are allowed 
for 171 = 1. We start again with the definition for m = 1. To prevent a self-loop in the first step, 
we let Gi^si^) undefined, and start from Gi^5(2), which is defined by the vertices 1'^' and 2*^' joined 
together by 2 edges. Then, for t > 2, we define, conditionally on Gi^sit), the growth rule to obtain 
Gi^s{t + 1) as follows. For (5 > -1, 

P((i + 1)'^' ^ ^^'^\G,4t)) = ^0^, for i G [t]. (1.2) 

The model with m > 1 is again defined in terms of the model for m = 1, in precisely the same way 
as in model (a). This model is studied in detail in [25], and the model with m = 1 corresponds to 
scale-free trees as studied in e.g. [37 1 l38l H2] . 

(c) In the third model, and conditionally on Gm,5it), tbe end points of each of the m edges of vertex 

are chosen independently, and are equal to a vertex z'""' S Gm,5{t), with probability proportionally 
to D^{m){t) + 5, where 5 > —m. We start again from Gm,5(2), with the vertices 1^™' and 2'™' joined 
together by 2m, m > 1, edges. Since the end point of the edges are chosen independently we can 
give the definition of {Gm,5it)}t>2, for m > 1, in one step. For 1 < j < m, 

edge of (t + l)'™' is connected to i'""' IG^ s{t)) = ^f"' , for i e W- (1-3) 

/ t[2m + 0) 

In this model, as is the case in model (b), the graph Gm,s{t) is a connected random graph with 
precisely t vertices and mt edges. This model was studied in [231 136j . 

Remark 1.1. In models (a) and (b) for m > 1, the choice of 5' = 5 /m is such that in the resulting graph 
Gm,5{f), where m vertices in Gi{mt) are grouped together to a single vertex in Gm,5{t), the end points of 
the added edges are chosen according to the degree plus the constant 5. 

Remark 1.2. For m = 1, the models (h) and (c) are the same. This fact will he used later on. 

The growth rules in (jl.ip - (jl.3p are indeed such that vertices with high degree are more likely to 
attract edges of new vertices. One would expect the models (a)-(c) to behave quite similarly, as is known 
rigorously for the scale-free behaviour, where the asymptotic degree distribution is known to be equal in 
models (a)-(c). As it turns out, the affine PA mechanism in (|l.ip - ()1.3p gives rise to power-law degree 
sequences. Indeed, in [23], it was proved that for model (c), the degree sequence is close to a power law 
with exponent r = 2> + 5/m. For model (a) and 5 = 0, this was proved in [17], while in [22], power-law 
degree sequences for PA-models with affine PA mechanisms are proved in rather large generality. We see 
that, by varying the parameters m > 1,5 > —m, we can obtain any power-law exponent r > 2, which is 
the reason for introducing the parameter 6 in (jl.ip - (jl.3p . However, there is no intrinsic reason for the 
affine PA mechanism. For results on PA-models in the non-affine case, see e.g., [4H 144 j. In general, such 
models do not produce power laws. 

The goal in this paper is to study the diameter in the above models, as a first step towards the study 
of distances in PA-models and the verification of the prediction that distances behave similarly in various 
scale-free random models (see also Section [1.21 below). In the following section, we describe our precise 
results. 

1.1 Bounds on the diameter in preferential attachment models 

In this section, we present the diameter results for the PA-models (a)-(c). The diameter of a graph G is 
defined as 

diam(G) = max{distG(i, j)|distG(i,i) < oo}, (1.4) 
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where distG'(i,j) denotes the graph distance between vertices i,j £ G. We prove that, for all 6 > —m, 
the diameter of Gm,5{t) is bounded by a constant times logt. When 6 = 0, we adapt the argument in 
[15] to prove that the diameter is bounded from below by (1 — e) loj^iogf ■ S > 0, this lower bound is 
improved to a constant times logt, while, for 6 < 0, we prove that the diameter is bounded above and 
below by a constant times log log This establishes a phase transition for the diameter of PA-models 
when 6 changes sign. We now state the precise results, which shall all hold for each of the models (a)-(c) 
simultaneously. In the results below, for a sequence of events {At}t>i, we write that At occurs with high 
probability (whp) when limt^oo IP(^t) = 1- 

Theorem 1.3 (A logt upper bound on the diameter). Fix m > 1 and 5 > —m. Then, there exists a 
constant ci = ci{m,5) > such that whp, the diameter of Gm.^s{t) is at most cilogt. 



When m = 1, so that the graphs are in fact trees, there is a sharper result proved by Pittel 
which, in particular, implies Theorem 11.31 for model (b). In this case, Pittel shows that the height of the 
tree, which is equal to the maximal graph distance between vertex 1 and any of the other vertices, grows 
like ^(2~^s) ^o§^(-'- + where 7 solves the equation 

7 + (l + 5)(l + log7) = 0. (1.5) 

This proves that the diameter is at least as large, and suggests that the diameter has size logt(l + 

0(1)). Scale-free trees have received substantial attention in the literature, we refer to |16[ I42| and the 
references therein. It is not hard to see that a similar result as proved in [l2] also follows for models (a) 
and (c). This is proved when 5 = m [16], where it is shown that the diameter in model (a) has size 
7~^ log t, where 7 is the solution of (jl.Sp when 6 = 0. Thus, we see that the log t upper bound in Theorem 
11.31 is sharp, at least for m = 1. 

It is not hard to extend the upper bound to m > 2. In particular, for model (b), the upper bound 
for m > 2 immediately follows from the upper bound for m = 1. For models (a) and (c), the extension 
is not as trivial, but the proof is fairly straightforward, and will be omitted here. To see an implication 
of [12] for model (a), we note that Ct, the number of connected components of Gi^s{t) in model (a), has 
distribution Gt = 1 + I2 + • • • + It, where /j is the indicator that the i*^ edge connects to itself, so that 
{Ii}i=2 independent indicator variables with 

= 1) = 7 FVT^^^^ (1-6) 

^ ^ {2 + 6){i-l) + l + d ^ ' 

As a result, Cj/logi converges in probability to (1 + (5)/(2 + (5) < 1, so that whp there exists a largest 
connected component of size at least t/\ogt. Conditionally on having size St, the law of any connected 
component in model (a) is equal in distribution to the law of the graph Gi^s{st + 1) in model (b), apart 
from the fact that the vertices 1 and 2 in Gi^si^t + 1) are identified (thus creating a double self-loop) and 
the vertices are relabeled by order of appearance. In particular, conditionally on having size st, the law 
of the diameter of the connected component in model (a) equals that of Gi^s{st + 1) in model (b). This 
close connection between the two models allows one to transfer results for model (b) to model (a) when 
m = 1. 

Theorem 1.4 (A logt lower bound on the diameter for 5 > 0). Fix m > 1 and 6 > 0. Then, there exists 
C2 = C2{m,5) > 0, such that whp, the diameter of Gm^sit) is at least C2logt. 

Theorems [LSHLD imply that, for 5 > and whp, diam{Gm,5it)) = G(logt). Theorems [OHLH 
indicate that distances in PA-models are similar to the ones in other scale-free models for r > 3. We shall 
discuss this analogy in more detail below. As we shall see in Section 12. 2| the proof of Theorem 11.41 also 
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reveals that, whp, the typical distance in Gm.^si't)^ which is the distance between two uniformly chosen 
connected vertices in the graph, is bounded from below by C2 logt. 

We conjecture that, for > 0, a limit result holds for the constant in front of the log t. In its statement, 
we write distc ('Vi , '(^2 ) for the graph distance in the graph G between two vertices ui,V2 G [t]- Then, the 
typical distance in a graph G is defined by distG(^i,V2) where Vi,V2 £ [t] are two uniformly chosen 
independent vertices. 

Conjecture 1.5 (Convergence in probability for 6 > 0). Fix m > 1 and 5 > 0. Then, the diameter 
diam{G m,5 (t)) / log t and the typical distance distcCVi, V2)/logt converge in probability to positive and 
different constants. 

We now turn to the case where 6 £ (— m, 0) and hence t = 3 + 5/m £ (2, 3): 

Theorem 1.6 (A log log t upper bound on the diameter for 6 < 0). Fix m > 2 and assume that 5 G 
(— m, 0). Then, for every cr > 1/(3 — r) and with 

Co = |, / + 7^, (1.7) 

I log (r — 2)1 logm 
the diameter of Gm,5{t) is, whp, bounded above by C^loglogi, as t ^ oo. 

In this result, we do not obtain a sharp result in terms of the constant. However, the proof suggests 
that for most pairs of vertices the distance should be equal to | (t— 2) logfog*(l + o(l)). When m = 1, 
Theorem 11.61 does not hold (see the discussion below Theorem 1 1.3p . 

We next discuss the lower bound on the diameter for 5 G {—m, 0): 

Theorem 1.7 (A log log t lower bound on the diameter). Fix m > 2 and 6 > —m. Then, the diameter 
of Gm,5{t) is, whp, bounded below by log log t, for all £ G (0,1). 

Unfortunately, the proof of Theorem 11.71 does not allow for an extension to typical distances, and, 
thus, we have no matching lower bound for this. We finally conjecture that, for 5 G (— m, 0), a limit 
results holds for the constant in front of the log log t: 

Conjecture 1.8 (Convergence in probability for 5 < 0). Fix m > 2 and 5 G (m, 0). Then, the diameter 
diam(Gm^5(i))/log logt and the typical distance distG(Vi, V2)/ log log t converge in probability to positive 
and different constants. 



1.2 Discussion of universality of distances in power-law random graphs 

Theorems I1.3H1.7I prove that the diameter in PA-models with a power-law degree sequence denoted by 
r undergoes a phase transition as r changes from r G (2, 3) to r > 3. The results identify the order 
of growth of the diameter of three related models of affine PA models as the size of the graph t tends 
to infinity. We do not obtain the right constants. For the typical distance, we obtain a similar phase 
transition, and again the results identify the correct asymptotics for r > 3, but, for r G (2, 3) we miss a 
matching lower bound. 

In non-rigorous work, it is often suggested that the distances are similarly behaved in the various 
scale-free random graph models, such as the configuration model or various models with conditional 
independence of edges as in [11]. For power-law random graphs, this informal statement can be made 
precise by conjecturing that distances have the same leading order growth in graphs with the same power- 
law degree exponent. This, however, is not correct for the diameter of such power-law random graphs, 
since the diameter depends sensitively on the details of the graph, such as the proportion of vertices with 
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degrees 1 and 2. See [29] and |34] for results showing that for the configuration model with power-law 
degree exponent r € (2,3), the diameter can be of order logt or of order log log t depending on the 
proportion of vertices with degrees 1 and 2, where t is the size of the graph. Similarly, in inhomogeneous 
random graphs with power-law degree exponent r S (2,3) the diameter is always of order logt (see e.g. 
|11]). while the typical distances can be of order loglogi (see e.g. \18\ I19j). Thus, we shall interpret the 
physicists' prediction by conjecturing that the leading order growth of the typical distances of various 
power-law random graphs depends only on the power-law degree exponent r G (2,3). 

The results on distances are most complete for the configuration model (CM), see e.g. \27\ [29| [32| 
[33l I43j . In the CM, there are various cases depending on the tails of the degree distribution. When the 
degrees have infinite mean, then typical distances are bounded [27], when the degrees have finite mean 
but infinite variance, typical distances grow proportionally to log log f [33] I43|. where t is the size of the 
graph, while, for finite variance degrees, the typical distances grow proportionally to logt |32j . Similar 
results for models with conditionally independent edges exist, see e.g. [HI [181 [261 00] , but particularly in 
the regime r G (2, 3), the results are not that strong. Thus, for these classes of models, distances are quite 
well understood. If the distances in PA-models are similar to the ones in e.g. the CM, then we should 
have that the distances are of order logt when r > 3, i.e., 6 > 0, while they should be of order log logt 
when T £ (2,3), i.e., for 5 < 0. In PA-models with a linear growth of the number of edges, infinite mean 
degrees cannot arise, which explains why r > 2 for PA-models. An attempt in the direction of creating 
PA-models with power-law exponent r G (1,2) can be found in [23], where a preferential attachment 
model is presented in which a random number of edges per new vertex is added. In this model, it is 
shown that the degrees again obey a power law with exponent equal to r = min{3 + j^,Tw}, where Ty^ is 
the power-law exponent for the number of edges added and fj, < oo the expected number of added edges 
per vertex. Thus, when r^, G (1)2), infinite mean degrees can arise. This model is further studied in [8], 
where a wealth of results for various PA-models can be found. 

There are few results on distances in PA-models. In [15], it was proved that in model (a) and for 
6 = 0, for which r = 3, the diameter of the graph of size t is equal to logi^g^ (1 + o(l)). Unfortunately, 
the matching result for the CM has not been proved, so that this does not allow us to verify whether the 
models have similar distances. The results stated above substantiate the physicists' prediction, since, for 
6 > for which r G (3, oo), the typical distances are of order logt, while, for 6 < 0, for which r G (2,3), 
they are bounded above by log logt. A related result on PA-models in the spirit of [22] can be found in 
[20] ■ where a similar phase transition as in this paper is proved, in the case where the number of edges 
grows at least (logt)^"'"'' times as fast as the number of vertices. 

It would be of interest to improve the bounds presented in this paper up to the constant in front of 
the logt and log logt, respectively. Due to the dynamical nature of PA-models, this is more involved for 
PA-models than it is for static models such as the CM and inhomogeneous random graphs. 

This paper is organized as follows. In Section [21 we prove the log t lower bound for the diameter stated 
in Theorem 11.41 In Section [31 and Section [H we prove the log log t upper bound and the log log t lower 
bound, on the diameter for 6 < 0, of Theorem 11.61 and Theorem 1 1.7[ respectively. 

2 A log lower bound on the diameter for ^ > 0: Proof of Theorem 11.41 

In this section, we prove Theorem 11.41 by extending the argument in [15] from 6 = to 6 > 0. We shall 
also extend the lower bound for (5 = to models (b) and (c). 
For model (c), denote by 

{9{t,j)=s}, l<j<m, (2.1) 

the event that at time t the j^^ edge of vertex t is attached to the earlier vertex s < t. For models (a) 
and (b), this event means that in {Gi^s'{mt)} the edge from vertex m(t — 1) + j is attached to one of the 
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vertices m{s — 1) + 1, . . . ,ms. It is a direct consequence of the definition of PA-models that the event 
(j2.ip increases the preference for vertex s, and hence decreases (in a relative way) the preference for the 
vertices u, 1 < u < t, u ^ s. It should be intuitively clear that another way of expressing this effect is to 
say that, for different si ^ S2, the events ji) = si} and {g{t2,j2) = S2} are negatively correlated. 

In order to state such a result, we introduce some notation. For integer > 1 and i = 1, . . . ,ns, we 
denote by 

Es = f]{g{ti,ji) = s}, (2.2) 

i=l 

the event that at time tj the jf^ edge of vertex ti is attached to the earlier vertex s, for all i = 1, . . . , ng. 
We will start by proving that for each k > 1 and all possible choices of tj, ji, the events Eg, for different 
s, are negatively correlated: 

Lemma 2.1 (Negative correlation of attachment events). For distinct si,S2, ■ ■ ■ , Sk, 

k k 

f]E,^)<l[F{EsJ. (2.3) 

i=l i=l 

Proof. We will use induction on the largest edge number present in the events Eg. Here, for an event 
{5(^1 j) = s}, we let the edge number be m{t — 1) + j, which is the order of the edge when we consider 
the edges as being attached in sequence. The induction hypothesis is that (|2.3p holds for all k and all 
choices of ti,ji such that maxj^sm(tj — 1) + < e, where induction is performed with respect to e. We 
initialize the induction for e = m in models (a) and (b) and for e = 2m in model (c). We note that for 
this choice of e, the induction hypothesis holds trivially, since everything is deterministic. This initializes 
the induction. 

To advance the induction, we assume that (j2.3p holds for all k and all choices of ti,ji such that 
maxj^s ni{ti — 1) + ji < e — 1. Clearly, for k and ti,ji such that maxj^^ m(ti — 1) + j'j < e — 1, the bound 
follows from the induction hypothesis, so we may restrict attention to the case that maxj^^ m(tj — 1)+ jj = e. 
We note that there is a unique choice of t,j such that m{t — 1) + j = e. In this case, there are again two 
possibilities. Either there is exactly one choice of s and ti,ji such that ti = t,ji = j, or there are at least 
two of such choices. In the latter case, we immediately have that HiLi = 0, since the e^^ edge can 
only be connected to a unique vertex. Hence, there is nothing to prove. Thus, we are left to investigate 
the case where there exists unique s and tj, jj such that ti = t,ji = j. Denote by 

ris 

E's= n {9{%k) = s], (2.4) 
the restriction of Eg to the other edges. Then we can write 

k k 

[]Es, = {g{t,j) = s]r^Kr^ f| E^,. (2.5) 

By construction, all the edge numbers of the events in E'^ n HiLi sj^s -^s, are at most e — 1. Thus, we 
obtain 

k k 

p(f|s,,) <E[/[i5;;n fl i?,jPe-i(<?(t,j) = ^)], (2.6) 

where Pg-i denotes the conditional probability given the edge attachments up to the (e — 1)'^* edge 
connection, and where, for an event A, I[A\ denotes the indicator of A. 
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We now first treat model (c), for which we have that 

D,{t-l)+6 



e-l 



i9{t,j)=s) 



(2m + (5)(t- 1) 

We wish to use the induction hypothesis. For this, we note that 

Ds{t-l)=m+ Yl n9{t',j') = 

{t',j'y.t'<t~i 



s . 



(2.7) 



We note that each of the terms in (|2.8p has edge number strictly smaller than e and occurs with a non- 
negative multiplicative constant. As a result, we may use the induction hypothesis for each of these terms. 
Thus, we obtain, using also m + 6 > 0, that. 



(2m + 6){t-l)F(f]Es, 



i=l 



<{m + S)F{Ei) J] P(i?,J 

k 

+ J2 nKn{9{t',j') = s}) n nEs,). 

{t',j'):t'<t-l i=l:si^s 



(2.9) 



We can recombine to obtain 



Ds{t-l)+5 



i=l 



'(2m + (5)(i 



u] n 



(210) 



and the advancement is completed when we note that 

D,{t-l) + 5 



E 



{2m + 5){t-l) 



'(Es). 



(2.11) 



The proofs for models (a) and (b) are somewhat simpler, since the events -Eg. can be reformulated in 
terms of the graph process {Gi^s'it)}t>i- D 
We next give the probabilities of Es when < 2; we omit the proof, since it is a simple adaptation 
to that in 1151. 



(2.12) 



Lemma 2.2 (Connections in PA-models). There exist absolute constants Mi,M2, such that (i) for each 
1 < j < m, and t > s, 

r(i>('..-) = »)s^. 

and (a) for t2 > ti > s, and any 1 < ji,j2 < m, 

¥(^g{ti,ji) = s,g{t2,j2) 

where a = 



s < 



Mo 



(tlt2)^"''S^''' 



(2.13) 



We combine the results of Lemmas 12.11 and 12.21 into the following corollary, yielding an upper bound for 
the probability of the existence of a path. In its statement, we call a path T = {sq, si, . . . , s/) self-avoiding 
when Si ^ Sj for all < i < j < /. We use the notation x Ay = min(x, y) and x\/ y = max(x, y). Again, 
we omit the proof (for details, see |15|). 
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Corollary 2.3 (Path probabilities in PA-models). Let T = (sg, si, • • • , s;) he a self- avoiding path of length 
I consisting of the I + 1 unordered vertices sq, s\, . . . , si, then there exists an absolute constant C > such 
that 



z-1 



^(r G G„,,{t)) < (m'Cy n ZT^a- (2-14) 

^ ^ f^{siASi+i)"-{siVSi+i) 



2.1 Lower bound on the diameter for 5 = 

It follows from ([231]) that for 5 = 0, 

z-1 



r G G^4t)) < (m^Cy n —^=. (2.15) 
The further proof that ()2.15|) implies that for (5 > 0, 

log(3Cm2logt)' ^ ' 

is a lower bound for the diameter of Gm,s{'t)^ is identical to the proof of |15^ Theorem 5, p. 14], with n 
replaced by t. This extends the lower bound for (5 = for model (a) in [15] to models (b)"(c). □ 

2.2 The lower bound on distances for 5 > 

We next improve the bound in the previous section in the case when 5 > 0, in which case a = m/{2m+6) < 
1/2. From the above discussion, we conclude that 

k-l ^ 

^ g J=0 i^j ^ («i V Sj+i) 

where c = mPC, and where the sum is over s = (sq, . . . , Sk) with Sk = t, sq = 1, si > 1 for alH = 1, . . . , k—1 
and si 7^ s„ for all I ^ n, since we may assume that our path (sq, • • • , Sfc) is self-avoiding. Define 

t) = E n r-. — — ^'-''^ 

where now the sum is over s = {sq, . . . , Sk) with Sk = t, sq = i, si > 1 for alH = 1, . . . , A; — 1 and si ^ s„ 
for alH / n, so that 

p(distG„ ,w(i,t) = A:) < cVfc(i,t). (2.19) 
We study the function fkii,t) in the following lemma: 

Lemma 2.4 (A bound on fk). Fix a < 1/2. Then, for every b > a such that a + b < 1, there exists a 
Ca,b > such that, for every 1 < i < t and all k > 1, 

/^(M)<^. (2.20) 
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Proof. We prove the lemma using induction on k > 1. To initialize the induction hypothesis, we note 
that, for 1 < i < t and every b > a, 

M^,t) = ' , = '= '-C-Y < = ' (2.21) 

(i At)'^(i Vt)^"'' i^t^-" t\i) - t\i) iH^-^ ^ ' 

This initializes the induction hypothesis as long as Ca^b ^ 1- 

To advance the induction hypothesis, note that we have the recursion relation 

i—l ^ oo ^ 

A(M)<E^/^-i(^'*)+ E (2-22) 

s=l s=i+l 

We now bound each of these two contributions, making use of the induction hypothesis. For the first 
sum, we bound 

i— 1 ^ i—l ^ ^ ^fc— 1 i—l ^ ^ ^fc— 1 

X/ ga^l-a-^'^-l^'^' - ^a,b ^ s^jl-a gb-j^l-b ~ jl-a-f-l-b X/ '^a+b - ]^ _ ^ _ 5 ibfl-b ' (2.23) 
s=l s=l s=l 

since a + 6 < 1. For the second sum, we bound 

OO ^ t—1 ^ ^ OO -j^ 

^agl-a-^'^-l^^'*) - ^a,b^ X] ^agl-a ^fc^l-b + ^a,^^ X] ^a^l-a thgl-b (2.24) 
s=i+l s=j+l s=t+l 



r^k — l t—1 ^ i^fc — 1 OO i^/c — 1 ^ — 1 

^ a,b 1 '-"a,b 1 ^ 1 '-^a,b 1 ^'a,6 

(-1—6 / J cl — a+6 »'a+6 / j e2— a- — ' 



s=«+l s=t+l 

since l + 5- a>l, 2-a-5>l,6>a and (t/i)" < {t/if. We conclude that 

f^^i^t)<^^J-^ + ')<T-4^, (2.25) 

when 

Ca,b = -r^ + ^ > 1. (2.26) 

b — a 1 — a — b 

This advances the induction hypothesis, and completes the proof. □ 
Using Lemma 12.41 and (|2.19p , we obtain that 

p(dist,^^,(,(l,t) = A:)<^^^. (2.27) 

As a result, 

p(diam(G™,5(t)) < A;) < p(distG^^(,)(l, t) < A;) < ^i-f^^^^^^\^ = o{l), (2.28) 

whenever k < loglcC t) ^^S*- conclude that there exists C2 = C2{m,6) such that, with high probability 
diam(^Gm,5(i)) > C2 logt. □ 

We next extend the above discussion to typical distances. 
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Lemma 2.5 (Typical distances for 6 > 0). Fix m > 1 and 6 > 0. Let Ht = distj(Ai, A2) he the distance 
between two uniformly chosen vertices. Then, for C2 = C2(m, (5) > sufficiently small, whp, Ht > C2 logt. 

Proof. For C2 = C2{m,5) > 0, define 

Bt = #{i,j e[t]:i<j: dist^^ ^(,)(i, j) < C2logt}, (2.29) 

where #{A} denotes the cardinahty of the set A. 

By Lemma [231 with K = log {cCa^b V 2) and a < b < 1 — a, and for all 1 < i < j < t, 



P dist 



(i,j) = A;) <cVfe(^,j) < (2.30) 



As a result, 

I'(distG,„,,w(i,j) < C2logtj < (2.31) 



and thus, using also J2i=i ^ ^ ^ ^ / (1 ~ b), 



^ fKc2 

E[Bt]<0{l) J^b=0{t''''^')- (2.32) 



It now suffices to note that 



< C2logt) = E[/[distG„^,(,)(Ai, A2) < cslogt]] = ^ ^ = 0(1), (2.33) 

by (I2:32D . for every C2 > such that Kc2 + 1 < 2. □ 

Note that (12.16P is also a lower bound on typical distances in case 6 = 0, which can be proved as 
above. 



3 A log log upper bound on the diameter: Proof of Theorem [1^6] 

The proof of Theorem 11.61 is divided into two key steps. In the first, in Theorem 13.11 we bound the 
diameter of the core which consists of the vertices with degree at least a certain power of logt. This 
argument is close in spirit to the argument in [18] or [43] used to prove bounds on the typical distance for 
the inhomogeneous random graph and the configuration model, respectively, but substantial adaptations 
are necessary to deal with preferential attachment. After this, in Theorem 13.61 we derive a bound on the 
distance between vertices with a small degree and the core. We start by defining and investigating the 
core of the PA-model. In the sequel, it will be convenient to prove Theorem 11.61 for 2t rather than for t. 
Clearly, this does not make any difference for the results. We make use of some technical results, stated 
in the appendix. 

3.1 The diameter of the core 

We recall that r = 3 + 5/m, so that —m < 6 < corresponds to r G (2,3). We take a > 1/(3 — r) = 
—m/5 > 1 and define the core Covet to be 

Core* = {ie [t] : D,{t) > (logt)'^}, (3.1) 

i.e., all the vertices which at time t have degree at least (logt)'^. 
For A CI [t], we write 

diamt(A) = maxdistc (t)(i,j). (3.2) 

t,j€A 

Then, diam2j(Corej) is bounded in the following theorem: 
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Theorem 3.1 (The diameter of the core). Fix m >2 and 6 G {—m, 0). For every a > 1/(3 — r), whp, 

diam2t(Corei) < (1 + o(l))^i^i^. (3.3) 

The proof of Theorem 13. II is divided into several smaher steps. We start by proving that the diameter 
diam2t(Inneri), where 

Innert = {i ^ [t] ■■ Di{t) > ui} , and where ui = t^C^-i) (logt)~^ , (3.4) 

is, whp, bounded. The choice of ui is a technical one: ui is the largest value / so that, whp, the total 
degree of vertices with degree exceeding I can be bounded from below by tP~'^, see Lemma lA.ll In 
Proposition \'d.2\ we will show that the diameter of Inner^ is bounded. After this, we will show that the 
distance from any vertex in the core Corej to the inner core Inner^ can be bounded by a fixed constant 
times log log t. This also shows that diam2t(Coret) is bounded by a different constant times log log t. We 
now give the details. 

Proposition 3.2 (The diameter of the inner core). Fix m>2 and 6 G (— m, 0). Then whp, 

diam2f(Innert) < — — +6. (3.5) 
6 — T 

Proof. We first introduce the important notion of a t-connector between a vertex i G [t] and a set of 
vertices A C [t]. This notion will play a crucial role throughout the proof. We say that the vertex 
j G [2t] \ [t] is a t-connector between i and A if one of the first two edges incident to j connects to i 
and the other of the first two edges incident to j connects to a vertex in A. Thus, when there exists a 
t-connector between i and A, the distance between i and A in Gm,5(2t) is at most 2. 

We continue the analysis by first considering model (c). We note that for a set of vertices A and a 
vertex i with degree at time t equal to Di{t), we have that, conditionally on Gm,&{t)^ the probability that 
j G [2t] \ [t] is a t-connector for i and A is at least 

{D^{t) + 5\A\){Di{t)+5) nPAimit) 

[2t{2m + 5)Y - t^ ' ^ ' 

where in the inequality, we use that Di{t) > m, and we let rj = (m + 5)"^ /{2m{2m + 5))"^ > 0, while, for 
any ^ C [t], we write 

D^{t) = Y,D.,{t). (3.7) 

Note that for fixed j G [2t] \ [t] the lower bound (|3.6p holds independently of the fact whether the other 
vertices are t-connectors or not. 

We now give a coupling proof which shows that a subset of size nt = \ Vt\ of the set Inner^ has, whp, 
a bounded diameter. Lemma lA.ll in the appendix shows that, whp, Inner^ contains at least ^/t vertices. 
Denote the first [-v/tj vertices of Inner^ by /. For each pair ii, ^2 ^ I and each j G [2t] \ [t], the probability 
that j is a t-connector for ^1,^2 is, by (j3.6p . at least 

n _J_ ^--2 

t^ t^ log t log"^ t 

independently of the fact whether the other vertices are t-connectors or not. In the coupling we intend 
to compare the set / and all pairs of vertices of the set /, which are t-connected by some j G [2t] \ [t] 
with a so-called multinomial random graph The graph Hn^ has nt vertices and we identify the 
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= nt(nt — l)/2 ~ t/2 pairs of vertices, which we number from 1 to in an arbitrary order, with ceUs 
of a multinomial experiment with t trials and probabilities given by 



Pk = qt, I <k <et, po = l- etqt- (3.9) 
We can represent the t trials by independent random vectors Ni, N2, ■ ■ ■ , Nt, where 

N, = (AT,, 1, iV,- 2, . . . , iV,-,eJ, l<j<t, (3.10) 

with distribution 

F{Nj = U) = qt, F{Nj = 0) = 1 - etqt, (3.11) 

where Ij is the z*^ unit vector of length ej, and the null vector. If cell k of the multinomial experiment 
is not empty, i.e., if X]j=i -^j,fc > 0, then we draw the edge with number k in the graph Hnt, if the cell 
is empty then this edge is left out. Note that cell is just an overflow cell, which counts the number of 
trials that not resulted in one of the cells 1, 2, . . . , e^. 

By the statement in (j3.8p the distance in Gm,&{'2t) between any two vertices in / is at most two times 
the distance between the corresponding vertices in H^^- In Lemma [A. 2 1 of the appendix we will show that 
the diameter of Hn^ is at most the diameter of a uniform Erdos-Renyi graph G{nt,mt), with nt vertices 
and nit edges, where 

mt = let(l- (l-qtY). (3.12) 



2 

From |35| Section 1.4] we conclude that the above mentioned uniform Erdos-Renyi graph G{nt,mt) 
is asymptotically equivalent with the classical binomial Erdos-Renyi graph G{nt,Xt), where the edge 
probability At is defined by 



1 



1 



A, = 5(l-(l-*)'J~^. (3^13) 

Next, we show that diam(G(nj, A^)) is, whp, bounded by + 1- For this we use the results in 
Corollaries 10.11 and 10.12], which give sharp bounds on the diameter of an Erdos-Renyi random graph. 
Indeed, this results imply that if p^n — 21ogn — )• 00 and n^(l — p) — ?• 00, then diam(G(n,p)) = 2, whp, 
while, for d > 3, if (log n)/d — 3\og logn — t- 00 and p'^n'^~^ — 2 log n — ?• 00, while p'^~^n'^~'^ — 2 log n — t- —00, 
then diam(G(n,p)) = d, whp. In our case, n = nt = Yt^^'^\ and p = Aj, which implies that, whp, 
diam(G(n,p)) = \j^E^ + Ij. We therefore obtain that the diameter of / in Gm^sC^t) is, whp, bounded by 

diam2t(/) < ^^^^ + 2. (3.14) 

O — T 

We finally show that for any i S Inner^ \ /, the probability that there does not exist a t-connector 
connecting i and / is small. Indeed, since Dj{t) > \/tui and Di{t) > ui, the mentioned probability is 
bounded above by 

rjDAtmtlY r rjDAt)lMt)\ / r?nf \ / rjtr^\ 
1 <exp<^ ^<exp<^ 7^^<exp<^ — — V = o(t ), (3.15) 



t2 J ^-^y t j-^^^^^ ^^^...^^ ^^^^ 

for r < 3. Thus, whp, such a vertex i does not exist. This proves that whp the distance between any 
vertex i £ Inner^ \ / and / is bounded by 2, and, together with the above bound on diam2t(/) we thus 
obtain (1331). □ 
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Proposition 3.3 (Distance from the core to the inner core). Fix m > 2 and 6 £ (— ?n, 0). With high 
probability, the inner core Inner^ can be reached from any vertex in the core Coret using no more than 
|k3g(T-^2)| ^'^9es in Gm,5{2t). More precisely, whp, 

max min distg < |,^^°f ^"^l, - (3.16) 

JSCorct jelnncrt \\og[T — I)\ 

Proof. For /c > 1, we define 

M^'^ = {i G [t] : D,{t) > Uk}, (3.17) 

with ui defined in p.4p . and where we define Uk, for k > 2, recursively, so that for any vertex i G [t] 
with degree at least Uk, the probability that there is no t-connector for the vertex i and the set AA'*""^', 
conditionally on Gm,5{t), is tiny. According to (j3.6p and (jA.ip in the appendix, this probability is at most 

(l - ^-E^^^^Ml^' < exp { - ^Bt{u,.,f-u, ^ ^ ^^^^^^ 
for some > 0, when we define 

Uk = D\ogt[uk^iy~ , (3.19) 
with D exceeding 2{rjB)~^ and t is sufficiently large so that < tti. The following lemma identifies Uk'- 
Lemma 3.4 (Identification of u^). For each /c G N, 

Uk = D""'{\ogtf^t'''', (3.20) 

where 

1 - (r - 2f-^ l-{r- 2f-^ 1 (r - 2)^-^ 

a. = , 6. = -(r-2) , = ^. (3.21) 

Proof. We leave the straightforward induction proof to the reader. □ 

Then, the key step in the proof of Proposition 13.31 is the following lemma: 

Lemma 3.5 (Connectivity between A/"^*""^' and AA'*"'). Fix m > 2 and 6 G (— m,0). Then, uniformly in 
k, the probability that there exists an i £ AA**"' that is not at distance at most two from A/'''°"^' in Gm,5{2t) 
is o(t"i). 

Proof. It follows from (j3.18p that the probability in the statement is by Boole's inequality bounded by 

texp ( - vBt[uk^^,r^Uk ^^ ^ ^ ^ ^3_22^ 

□ 



We now complete the proof of Proposition 13. 3[ Fix 

log log t 



k* 



log (r - 2) 



(3.23) 



As a result of Lemma 13.51 we have that the distance between AA** ^ and Inner^ = AA'^' is at most 2k*. 
Therefore, Proposition 13.31 follows when we can show that 

Coret = {i : D,{t) > (logt)"} C AA'^*' = {i : Di{t) > u^*}, (3.24) 
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so that it suffices to prove that (logt)'^ > u^*, for any a > 1/(3 — r). This follows trivially for t large 
from the explicit representation of Uk* given by Lemma l3.4i □ 



Proof of Theorem \3.1[ We note that whp 



diam2t(Coref) < 




+ 6 + 4/c* 



(3.25) 



where k* is given in ()3.23p . and where we have made use of Propositions 13.21 and 13.31 This proves Theorem 



3.2 Connecting the periphery to the core 

In this section, we extend the results of the previous section and, in particular, study the distance between 
the vertices not in the core Coret and the core. The main result is the following theorem: 

Theorem 3.6 (Connecting the periphery to the core). Fix m > 2 and 5 G (— m,0). For every a > 
1/(3 — r), whp, the maximal distance between any vertex and Corej in Gm,&{'^t) is bounded from above 
by 2a log log t / log m . 

Together with Theorem 13.11 Theorem 13.61 proves the main result in Theorem 11.61 

The proof of Theorem 13.61 consists of two key steps. The first key step in Proposition 13.71 states that 
the distance between any vertex in [t] and the core Corej is bounded by a constant times log log The 
second key step in Proposition 13.101 shows that the distance between any vertex in [2t] \ [t] and [t] is 
bounded by another constant times log log 

Proposition 3.7 (Connecting half of the periphery to the core). Fix m > 2 and 5 G {—m, 0). For every 
a > 1/(3 — t), whp, the distance between any vertex in [t] and the core Core^ in Gm,s{'^t) is bounded 
from above by o" log log t/ log m. 

Proof. We start from a vertex i G [t] and will show that the probability that the distance between i and 
Corcf is at least o" log log t/ log m is o{t~^). This proves the claim. For this, we explore the neighborhood 
of i as follows. From i, we connect its m > 2 edges. Then, successively, we connect the m edges from 
each of the at most m vertices that i has connected to and have not yet been explored. We continue in 
the same fashion. We call the arising process when we have explored up to distance k from the initial 
vertex i the k-exploration tree of vertex i. 

When we never connect two edges to the same vertex, then the number of vertices we can reach within 
k steps is precisely equal to m^. We call an event where an edge connects to a vertex which already was 
in the exploration tree a collision. When k increases, the probability of a collision increases. However, 
the probability that there exists a vertex for which more than I collisions occur in its /c-exploration tree, 
where / > 1, before it hits the core is small, as we prove now: 

Lemma 3.8 (A bound on the probability of multiple collisions). Fix m > 2 and 5 G (— m,0). Fix 
Z > 1, 6 G (0,1] and take k < crloglogt/logm. Then, for every vertex i G [t], the probability that its 
k-exploration tree has at least I collisions before it hits Core^ U [t^] is bounded above by 



[3Tl 



□ 




(3.26) 
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tree has degree at most (logt)'^. Hence, for I = 1, the probabihty mentioned in the statement of the 
lemma is at most 

^ iMtHl^ ^ {log tr ^ m^+i(logO" ^ ^3 27) 

where the bound follows from 5 < and G T^''^} < m^^^. For general Z this upper bound becomes: 

When /c = o" log log log m, we have that m^^ = {\ogt)'^K Therefore, the claim in Lemma 13.81 holds. □ 

We next prove that there exists a 6 > such that, whp, [t^] is a subset of the core. Note that in this 
lemma the conditions m > 2 or 5 G {—m, 0) are not necessary. 

Lemma 3.9 (Early vertices have large degrees whp). Fix m > 1. There exists a b > such that, for 
every ex > 1/(3 — r), whp, miUj^^f, Dj{t) > (logt)'^. As a result, whp, [t^] C Corcf. 

We defer the proof of Lemma 13.91 to Section IA.3I of the appendix. Now we are ready to complete the 
proof of Proposition 13. 7t 

Proof of Proposition \37}\ By combining Lemmas 13. 81 and 13. 9| the probability that there exists an i G [t] for 
which the exploration tree TJ^*' has at least / collisions before hitting the core is o(l), whenever Z > 1/6, 
since, by Boole's inequality, it is bounded by 

t 

m'^(logt)2'^yt^' = m'(logt)2'^'r^'+i = o(l). (3.29) 

1=1 

When the fc-exploration tree hits the core, then we are done. When the /c-exploration tree from a vertex i 
does not hit the core, but has less than / collisions, then there are at least m^~^ vertices in /c-exploration 
tree. Indeed, when we have at most / collisions, the size of the ^-exploration tree is minimal when all edges 
of the root connect to the same vertex vi, all edges of t^i connect to the same vertex V2, etc. Iterating 
this at most I levels deep yields a tree with at least m*^~' vertices. 

When k = o" log log t/ log m — 2, m^~'- > (log t)'^'''°^^^ The total degree of the core is, by (lA.ip in the 
appendix, at least 

A(t) >i?^(log^)^("-')^ (3.30) 

iSCorct 

for some B > 0. The probability that there does not exist a t-connector between the /c-exploration tree 
and the core is, by (|3.6p and (|3.30p . bounded above by 



/ 7?i?t(l0gt)'(--2V(l0gt)-+°(l) | 1 

exp <^ } = o{t ), (3.31) 



t 



since cr > 1/(3 — r). This completes the proof. □ 

Proposition 3.10 (Connecting the remaining periphery). Fix m > 2 and 6 G (— m,0). For every 
(J > 1/(3 — r), whp, the maximal distance between any vertex and \t\ in Gm,si2t) is bounded from above 
by uloglogt/logm. 
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Proof. Take k = a log log t / log m — 1 , and j G [2t] \ [t] with distance larger than k to the set of vertices [t] . 
We now apply Lemma 13.81 with t replaced by 2t and letting / = 2 and b = bt ^ (0, 1) such that (2t)^ = t, 
to conclude that with probability exceeding 1 — o(l), the /c-exploration tree of j has at most 1 collision 
before it hits Core2t U [t]. We can hence conclude that with probability exceeding 1 — o(l), there are at 
least mfc = {m — l)'m''~^ vertices in [2t] \ [t] at distance precisely equal to k from our starting vertex j. 
Denote these vertices by ii, . . . , im^.. We consider case (c), the proof for (a) and (b) is similar. Note that, 
uniformly in s G [2t] \ [t], 

E-=i(A(^) + ^) ^1 

{2m + 6)s -2- ^"^-^^^ 

Hence, 

P(^/ e [mfc] such that distG,„ ,(2j)(i;, Coreat U [t]) < l) < 2"™^ = o(t-^), (3.33) 



since = ^^^(logt)°", with a > 1/(3 — r) > 1. Therefore, any vertex j S [2t\ \ \t\ is, whp, within 
distance k + 1 from Coreat U \t\. Proposition IA.3I shows that, whp the set Coreat C \t], so that, whp, 
Core2t U [t] = [t\ and the proposition follows. □ 

Proof of Theorem \3M Proposition 13.101 states that whp every vertex in Gm,5{2t) is within distance 
(T log log t / log m of [t] and Proposition 13.71 states that whp every vertex in [t] is at most distance 
(T log log i / log m from the core Coret. This shows that every vertex in Gm,5{2t) is whp within distance 
2(7 log log 1 1 log m from the core. □ 

Proof of Theorem \1.6i Theorem 13.61 states that every vertex in Gm,s{2t) is within distance ^"^/pg * of 
the core Core*. Theorem 13.11 states that the diameter of the core is at most j^^^^i^Cl + o(l)), so that 
the diameter of Gm,5{2t) is at most log log t, where Cq is given in (II. 7p . because we can choose any 
0" > 1/(3 — r). This completes the proof of Theorem 11.61 □ 



4 A log log i lower bound on the diameter: Proof of Theorem 11.7 

We will again prove this theorem for time 2t rather than time t. To show that the diameter of the graph 
is, whp, at least k, we will study, at time 2t, the fc-exploration trees 7^**^ of vertices i G [2i]\[i] as defined 
above. We shall call the tree T^'*"' proper if the following conditions hold: 

• The fc-exploration tree has no collisions; 

• All vertices of Tj'*"* are in [2t]\[t]; 

• No other vertex connects to a vertex in T^'*"'. 

When such a tree exists in Gm,5{2t) for a certain vertex i then we know that the diameter is at least 
k, since the distance between the root of the tree i and the vertices at depth k is exactly k] there cannot 
be a shorter route. 

To prove that a proper /c-exploration tree exists in Gm,si2t), we will use a second moment method. 
Let T^(2t) be the set of all possible fc-exploration trees that can exist in Gm,5{2t) and satisfy the first 
two conditions. Note that the order in which the edges are added matters: if two edges are added in a 
different order, then the arising exploration tree will be considered a different tree. Let Z^'^(2t) be the 
number of proper A;-exploration trees in Gm,&{2t), i.e., 

^ml5(2*) = ^["^ ^ Grn,s{2t) and r is proper]. (4.1) 
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Here the event that all edges of T have been formed in Gm,s{'^t) is denoted by T ^ Gm,5{'^t). 
In Section [4.11 we will investigate the first moment of Z'^^^{2t) and prove the following: 

Proposition 4.1 (Expected number of proper trees tends to infinity). Fix m > 2 and 6 > —m. Let 

k 



log m 



log log with < e < 1. Then 



lim E 



2t) 



oo. 



(4.2) 



The variance of Z^'^(2t) will be the subject of Section \^.2\ where we will prove the following: 



Proposition 4.2 (Concentration of the number of proper trees). Fix m > 2, 6 > —m and let < k < 

'"ogm* • ^^fi'^ there exists a constant Cm,s > 0, such that, fort sufficiently large, 



Var Z;^y2t) < Cm,s 



(logt)^ 



■E 



+ E 



We use these two propositions to prove Theorem 11.71 
Proof of Theorem We first use the Chebychev inequality to obtain that 



'(diam(G„,5(2f)) < A:) < ¥ {Z'^\{2t) = < 



Var(Z^;y2t) 



E 



By Proposition 14. 2^ the right-hand side of ()4.4p is, for some constant Cm,5 > 0, at most 

1 



{hgtl 



E 



0(1) 



by Proposition 14.11 



(4.3) 



(4.4) 



(4.5) 
□ 



4.1 The first moment of the number of proper trees 

Let Bj- denote the event that no vertex outside a tree T connects to a vertex in this tree. We can then 
write that the expected number of proper fc-exploration trees in Gm,<5(2t) equals 



E 



E ■ 



T Gm,s{'^t) and T is proper 
T is proper] r C G™,5(2t))pfr C Gm,5{2t) 



(4.6) 



We will first give a lower bound on the probability that a given fe-exploration tree exists in the graph 
at time 2t. For convenience we will write am,s = 3(2rra+5) • 

Lemma 4.3 (Lower bound on existence probability). Fix m > 2, 6 > —m and k > 0. Given a proper 
k-exploration tree T G T^(2i), then, for t sufficiently large. 



'(TCGm,sm^ > 



(4.7) 



where m 



(fc) m 



fe+i_ 
m—1 
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Proof. Since every vertex is added before time 2t, the denominator in ()l.ip - (|1.3p is at most 3t{2m + 6). 
The degree of all vertices already in the graph is at least m, so the probability that a certain given edge 
is formed is at least 

^ + ^ = ^ (A R) 

3t{2m + 6) t ' ^ ' 

Since exactly m'*"' — 1 edges have to be formed to form the given tree 7~, we have that 



rcG„,5(2t)j > (^^j . (4.9) 

□ 

We will now give a lower bound on the probability that no other vertex connects to a given tree. 

Lemma 4.4 (No other vertex connects to T). Fix m > 2, 6 > —m and < /c < ^^f^^- Given a proper 
k-exploration tree T G %^(2t\ then, for t sufficiently large, and writing ms = m + l + 5>l, 

/ fc+i \ 

Br\T c G^,s{2t)) >(l- !!^^!^ j . (4.10) 



Remark 4.5. In FyB-j-lT ^ Gm,5{2t)j , Bj- makes a claim about edges not in T, while the event T ^ 

Gm,<5(2t) states that all edges in T are formed in our random graph process. Thus conditioning on 
T ^ Gm,(5(2t) gives information only about inside edges. 

Proof. First note that for k < and t sufficiently large, msm^^^ < msmlogt < t. So < 1 — 

fe + 1 

^^^^ — < 1. Further note that vertices in [t] cannot connect to a vertex in 7~, since T ^ [2t]\[t]. In the 
remainder of the proof we will refer to outside edges as those edges that do not belong to T, of which 
there are exactly mt — (m'*"' — 1) added after time t. For A a set of vertices, let Sni^) denote the event 
that the n-th outside edge added after time t connects to a vertex in A and let £n{A) be the complement 
of £n{A). We use induction on the number of outside edges that did not connect to the tree T, i.e., we 
show that: 



\i=l 



n £iir) r c Gm,sm ) > ( 1 - ^^^^^ ) , (4.11 



by induction on n = 0, . . . , mt — (m''"' — 1). For n = the above holds, because both sides equal 1. Now 
assume that the above holds for < n < mt — {m'-^^ — 1), then 



' n+l 



^£^{T)T<^G,^,^sm\ 
i=i J 

= IP {£n+i{T)\ fj F,(r) n {r c G„,,5(2t)}^ p - ^-'^(2^)) 

> l^l-F{£n+i{T)\^8,{T)r\{T ^G^,s{2t)}^ . (^l _ !!^^!^^ . (4.12) 



Since it is known that at the time that the (n + l)-st outside edge after time t is added, no other outside 
edge has connected to a vertex in the tree, we know that the degree of all vertices in the tree at that 
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moment is at most m + 1. Further, since this edge is added after time t, the denominator of (|l.ip - (|1.3p 
will be at least t. Thus, the right-hand side of ()4.12p is at least 



^ t \ t J - \ t 

fc+l \ n+l 

1 - "^^7 ) , (4.13) 

where the inequality holds because there are less than m^~^^ vertices in the tree. Applying the above to 
n = mt — (m'*'' — 1), we obtain that 

□ 

We finally give a lower bound on the number of possible proper /c-exploration trees that can be formed. 
It should be noted that when a vertex i connects to a vertex j, we will always have that i > j- So when 
exploring a vertex i in the exploration tree, all m vertices this vertex connects to have a smaller label 
than i. 

Lemma 4.6 (Number of proper trees). Fix m > 2 and < k < '°pg'^* • Then, for t sufficiently large, 

(fc) 

the number of possible proper k-exploration trees at time 2t is at least {t/m^^^^ , where we recall that 

m— 1 

Proof. For t sufficiently large and k < '"^g'^S m^~^^ < mlogt < t. Since the fc-exploration tree of a vertex 
i has to be proper, there are no collisions, so the number of vertices in the tree equals 

#{^; G T:'^'} =mW. (4.15) 

For any subset X C [2t]\[t] with ^{v G X} = m'-''^ there exists at least one possible proper fc-exploration 
tree. To see this, first order the vertex labels in descending order. Let the first vertex, i.e. the vertex 
with the largest label, be the root of the tree. Then let the next m vertices be the vertices at distance 
1 from the root, the next m? vertices be the vertices at distance 2 from the root, etcetera, until the last 

vertices which will be at distance k from the root. This way, all vertices will connect to m vertices 
with a smaller label, i.e., vertices that were already in the graph when the vertex was added, so this is a 
possible proper fc-exploration tree with all vertices in X. 

The number of subsets of [2t]\[t] of size m*'"' is {^{k)) which is at least 

where we used that for 1 < 6 < a we have that {a — i)b > (b — i)a for all < i < 6, so that 

^ i=0 

□ 
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We can now combine the three bounds above to get a lower bound on the expected number of proper 
fc-exploration trees. 

Corollary 4.7 (Lower bound on expected number of proper trees). Fix m > 2, 6 > —m and < k < 

" 3g log t 

log m 



log log t y^gj^^ j- sufficiently large, 



E 



Proof. Using the bounds from Lemmas 14.31 14.41 and 14.61 we get that 

>#{r6^S.(2<))fi-™"""'^'"''°"-'^'""'-' 



t J ^ t 



m^'+i J \ t J \ t 

«m,<5 ^mJ^+^J \ t 



□ 



The factor t in the corohary above turns out to be crucial for the remainder of the proof. This factor 
arises from the fact that there is exactly one edge less in a proper /c-exploration tree than there are 
vertices. 

We can now show that the expected number of fe-exploration trees tends to infinity, for k = j^^^ log log t, 
with < e < 1. 

Proof of Proposition First note that for k = j^^^^ log log t, with < e < 1, m'' = (logt)^. We can 
then use Corollarv 14.71 to get that 



lim E 



J. fc+1 / fr-l-l \ 



Z^' (2t) > hm — ^ 1 - ^^^^ = oo, (4.20) 



smce 



jjik+1 J \m(logtY J ' \^ mt 



It is easy to see that the same argument can be applied to k = ''i'^^"^ * — — 1- 



□ 



log m log m 

4.2 The second moment of the number of proper trees 

In this section we will investigate the variance of Z^^^{2t). To shorten the notation, for a A;-exploration 
tree 7~ G T^(2t), let Fj- denote the event that 7~ ^ 0^,5 {2t) and T is proper. Then, the variance of the 
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number of proper A;-exploration trees in Gm,s{'^t) is given by 

Var (^^1^(2*)) = Var ^ I[T C G„,5(2t) and T is proper] 

\rGXj=„(2t) 

= Var[ Yl nFT]]= E Gov (I[Fr], /[Fr^]) 
Vrex*i,(2t) / r,r' e'i*i,(2f) 

= Yl {P{FTnFr')-¥{FT)F{Fr'))+ IP(^r)(l-F(i"r)). (4.22) 

r,r'exj^(2t) rexfe,(2t) 

We start by studying the terms of the first sum in the following lemma. 

Lemma 4.8 (Weak dependence of tree occurrences). Fix m > 2, 6 > —m and < fc < '"pg*^* • Let 
T,T' G Tj^(2t) with T / T' ■ Then, for t sufficiently large, 

P {Ft n Fr>) - F {Ft) P (FrO ^ + ^"'^7^°^^ ) ^ " l j (^r) P (^rO • (4.23) 

Proof. When 7~n T' 7^ 0, at least one edge of one of the trees will connect to a vertex in the other tree, 
so the trees T and T' cannot both be proper. Thus, for T CiT' ^ 9, trivially (|4.23p holds. 

For Tn T' = 0, we have to take a closer look at the probabilities involved. All three probabilities in 
the lemma are a product over all edges of the probability that either the edge does not connect to any 
of the vertices in the tree(s) or the probability that the edge makes a prescribed connection in (one of) 
the tree(s). Let £j^s{A) denote the event that the j-th edge of vertex s connects to a vertex in A, with 
£j,s{i) = £j,s{{i})- Let Sj^s{A) be the complement of £j^s{A). We have that 

p(£:,,,(A)) = Ep(£:,-,(z)), (4.24) 

because the events on the right-hand side are disjunct. These probabilities are given by the growth rules 

(HID-dOD. 

Suppose that the j-th edge, 1 < j < m, of a vertex to should not connect to a vertex in TU T'. Then 
in P {Fj- n F-j-'), there will be a factor 

F{£,,t,{Tur)) = l-F{£,,t,{TuT'))=l- Y ^ (^jaM ■ (4-25) 

ieruT' 

In P (Fr) P {Fr'), there will be a factor 

ll-Y^i£j,toim (l-E^P('^J--to«))- (4.26) 

It is easy to see that 1 — 2; — y < (1 — 2;)(1 — y) for x, y > 0, so (j4.26p is at least as big as (|4.25p . 

When the j-th edge, 1 < j < m, of a vertex to, t + 1 < to < 2t, should connect to a vertex h £ T, 
then in P [Fj- D Fq-i) there will only be a factor 

P(%oW)> (4.27) 
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since it will then automatically not connect to a vertex in T'. In P (F-j-) P {Ff)^ however, there will be a 
factor 



(4.28) 



When we multiply (j4.28p by (l — YlieT' ^ ("^i.toC^))) ^ obtain precisely (|4.27p . By symmetry, the same 
holds when an edge should connect to a vertex in T' . Since the degree of the vertices in the trees is at 
most m + 1, the edges of interest are added after time t and there are less than m^~^^ vertices in the tree, 
we have that 

fi-Erft,.,.(i))V'<(i 



ieT' 



t 



(4.29) 



Since there are less than m^'^^ edges in both T and T', for Tn T' 

F{FrnFr>[ ' n -i / 



P(Fr)P(FrO 



her \ h&T' 



< 1 



msm 



k+l 



t 



-2m' 



k + l 



1 + 



msm' 



keV 

k+l 



t — msm^^^ 



2m' 



k + l 



< 1 + 



msm log t 
t — msm log t 



2m log t 



< 1 + 



2msm log t 
t 



2m log t 



(4.30) 
□ 



We can now use the lemma above to give an upper bound on the variance of Z^'^(2t) in terms of the 
expectation of z!^^^{2t). 

Proof of Proposition Let Cm,5 = 8msm^. Then, using Lemma 14.81 we have that 



Var 



{z'mlsm 



E iF{FrnFr')-F{Fr)FiFr'))+ E IP(^r) (l - F(i"r)) 

reT*;,(2t) 



T,T'&'^{2t) 



< 



E 



r,r'ex^(2t) 



1 + 



2msm log t 
t 



2m log t 



1 P(Fr)P(FrO+ E ^^^r)- (4.31) 



Since 



1 + 



2msm log t 



t 



2m log f 



1 < e~ 



1 < c, 



m,5~ 



(4.32) 



we have that ()4.3ip is at most 



Cm,5 ' 



t 



E P(Fr)P(FrO+E ZW,(2i) 



r,r'exji,{2i) 



< cm/^^ E (^r) IP (Ft') + IE [^::,U2t)' 



T,T'eX^{2t) 



C-m,5 ' 



(logt)^ 



■E 



Z^' (2t) +E Z^^y2t) 



(4.33) 
□ 
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A Appendix 



A.l The tails of the degree sequence 

Lemma A.l (The total degree of high degree vertices). Fix m > 1 and S > —m. Assume that It — s- oo, as 

1 _ 1 

t — )• oo and that /< < tii = t^f^^^' 0-Ogt) 2 . Then there exists a constant B > such that with probability 
exceeding 1 — o{t~^), 

m) > Bti^"- (A.l) 

i:D,{t)>lt 

Moreover, if N>i^(t) = #{i < t : Di{t) > k} is the number of vertices with degree at least h, then, whp, 

N^iM)>Vt. (A.2) 

Proof. We note that 

D.,{t)>ltN^,^{t). (A.3) 

i:D,{t)>lt 

In [23], detailed asymptotics for N>i^{t) were proved for model (c) that we will survey now. These 
asymptotics play a key role throughout the proof. 

Firstly, it is shown that there exists a Bi such that uniformly for all It, 

iV>,^(t) -E[Af>Jt)]| > Biy^thiPj = o(t^i). (A.4) 

This proves a concentration bound on the number of vertices with at least a given degree. The proof of 
this result follows the argument in [T7], and holds for any of the models (a)-(c). 
Secondly, with 

Ni^{t) = #{i<t:D,{t) = lt}, (A.5) 
the total number of vertices of degree equal to It, and with pi^ defined by 

(2 + 6/m)r{lt + 6)r{m + 6 + 2 + 6/m) , ^ , 
T{m + d)r{lt + 1 + + 2 + d/m) 

so that pk ~ with r = 3 + 5/m, there exists a constant B2 such that 

sup\E[Ni,it)]-tpi^\<B2. (A.7) 
i>i 

For model (c), this is shown in [23], for model (a) this is shown in |31l Chapter 8]. This latter proof can 
easily be adapted to deal with model (b) as well. In rather generality, results of this kind (with the sharp 
bound in ()A.7P ) are proved in [30] . 

Therefore, we obtain that, with probability exceeding 1 — o(t~^), 



Ar>Jt) > E[Ar>Jt)] - SiTHogi > E[iV>Jt)] - E[iV>2jt)] - Bi^/tlogt 

2/t-l 

> [iPi - B2] - Bi ^/tlogt > B-itl]-^ -B2lt- Bi y^tlogt, (A.8) 
l=lt 

for some B^ > 0. We now wish to pick It such that tll~'^ is the dominating term in the right-hand side of 
(IA.8p . i.e., k/t^/'' and ^/t logt/tl]"'^ 0, as t 00. Note that i > 2{t~i) r > 2, so for ui as 

in the statement of the lemma and It <ui, we find that (|A.ip holds with probability exceeding 1 — o{t~'^) 
and that, whp, iV>,,(t) > Vi. □ 
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A. 2 The diameter of the multinomial graph 

Lemma A. 2 (Diameter multinomial graph). Let Hn^ he the multinomial graph with parameters defined in 
()3.9p . Then, whp, the diameter of Hm is bounded from above by the diameter of the uniform Erdos-Renyi 
graph G{nt,mt), where the number mt of edges is equal to 

mt = let(l-il-qtY). (A.9) 



2 

Proof. Observe that by definition of the multinomial graph, and with et = nt{nt — l)/2, 

et t 
i=l j=l 

We only have to show that, whp, the random number of edges M„j dominates the deterministic number 
mt- This can be deduced from Chebychev's inequality as follows. 
From a straightforward calculation, 

E[M„J = etil - (1 - QtY) = 2mt, (A.ll) 

and 

Var(M„J = el{{l-2qtY-{l-qt)^')+et{{l-qt)'-{l-2qt)'). (A.12) 
The first term is negative, and the second term can be bounded by ej(l — (1 — qtY), so that 

Var(M„J < et(l - (1 - qtY) = E[M„J = 2mt, (A.13) 
so that the variance is of the same order as the first moment. Applying the Chebychev inequality yields 

P(M„, < mt) < Fi\Mnt - E[M„J| > nit) < Y^^iMjlA ^ q. (A.14) 

mi 



□ 



A. 3 Proof of Lemma 13.91 



We investigate the problem for model (a) first, the adaptation of the proof for model (b) is rather 
straightforward and will be omitted. The proof for model (c) is slightly more involved and is treated 
immediately after the proof for model (a). 

We first note that, for models (a) and (b), the model for general m > 1 is obtained from the model 
for m = 1 by taking 6' = 6/m and identifying groups of m vertices. For m = 1 and 5 > —1, we shall show 
by induction on j, that for model (a) 

\ t\ J J) - Jr(i + A)r(i)' ^ ' 

for all t > i and j ^ 'tt-, with A = (1 + (5)/(2 + (5) G (0, 1) and where Cj will be determined in the course 
of the proof. Clearly, for every t >i, for model (a), 

^^""'^'^ = '^ = jl^ V - {2 + 6)is-l) + {l + 6) ) = WTaWY ^^-''^ 
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which mitiahzes the induction hypothesis with Ci = 1. 

To advance the induction, we let s < t be the last time at which a vertex is added to i. Then we have 
that 

t • _ 1 _L J 

F{Di{t)=j)= IP(A(g - 1) = i - 1) (2 + s)(s - 1) + 1 + = ^'1-^-^") = ■^') • (^-1^) 

s=i+j—l 

By the induction hypothesis, we have that 
Moreover, analogously to ()A.16p . we have that 



t 



J + S 

r(t-i^)r(s + A) 



r(t + A)r(.-i^)- 

Combining ()A.17p . (jA.lSP and ()A.19p . we arrive at 



pfDrn-,^<r V r(.-i)r(z + A) j-i + j r(t - |g)r(. + A) 
r(A(ij - JJ - 2^ _ ^ ^ ^^^^^^ ^2 + - 1) + (1 + 5) r(i + A)r(. - f^) 



j-i + jr(z + A)r(t-^) * r(.-i) 
2 + . rw r(t + A) ^^l._^F(;TIETy (A.20) 



We note that, whenever / + 6, / + l + a>0 and a — 6 + 1 > 0, 

^ r(s + a) _ 1 r r(t + l + a) T{l + a) 1 T{t + l + a) 

^ r(s + 6) ~ a- 6+ U T{t + b) r{l-l + b)\ - a-b+l T{t + b) ' ^ ' ^ 

Application of ()A.2ip for a = —1, 6 = I = i + j — 1, so that a — b + 1 = > when j > 1, leads 

to 

^ ^ ^ -^^ - J ^ j _ 1 r(i) r(t + A) ^ ^ 

Equation ()A.22p advances the induction when we define 

= T{mi + 5y ^^■''^^ 

so that 

For m > 1, inequality (jA.lSP for model (a) generalizes to 

F(DAt) = j) < Q— --— ^7^±^ = C,-— -— — (A.25) 
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This completes the investigation of ¥{Di{t) = j) for model (a). In an identical fashion, for model (b), we 
obtain for m = 1 

F{Diit) = j) < C,^^^— (A.26) 



where again Ci = 1 and Cj satisfies (IA.23p . This generalizes to 



Y(t - ™+'^ )r(i) 

P(A(t) = j) < Cj ''!^^''\' . (A.27) 



We omit further details for model (b). 

For models (a) and (b) we can generalize the inequality for m = 1 to ?n > 1. Unfortunately this fails 
for model (c), and we first adapt the argument. Recall that Di{t) is the degree of vertex i at time t. We 
shall define Ei{t) such that Ei{t) < Di[t) and Ei{t) grows by at most one at each time step. The definition 
of Ei{t) is recursive. We let Ei{i) = Di{i) = m, and, assuming we have shown that Di{t) = Ei{t) + Ri{t), 
where Ri{t) > 0, we proceed at time t + 1 as follows. We can increase Ei{t) only when the first edge of 
vertex t + 1 attaches to vertex i, and we do this with probability -^^^j^- With probability (2m+l)t ' 
keep Ei(t + 1) = Ei(t) and we increase Ri{t) by one. For the other m — 1 edges, we increase Ri{t) by 
one with probability -^[j^- Then we clearly have that Ei(t + 1) < Di{t + 1) if Ei{t) < Di{t), since the 
difference between Di{t) and Ei{t) equals Ri{t), which is monotonically increasing. Moreover, we have 
that Ei{t + 1) equals Ei{t) or Ei{t) + 1, and the latter occurs with conditional probability 

¥{E,{t + 1) = m{t) =j-l)= f~^^,t (A.28) 

(2m + d)t 

We now adapt the above argument for model (a) to the random variable Ei{t). Indeed, we now use as an 
induction hypothesis that 



ntm - 



nm = j) < Cj i]\j"'^'':y, , (A.29) 



where Cm = 1 and, for j > m, 



^ j 1±^^ (A 30) 

J —m 

The verification of ()A.29P is a straightforward adaptation of the one of (jA.lSp . 
We summarize the bounds in models (a)~(c): for all m > 1, and i G [t],j > m, 

T{t + a2)T{i - ai) 

where Ei{t) = Di{t) in models (a) and (b) and where ai = for model (a), while ai = for models 

(b)-(c), while 02 = for model (a), while 02 = for models (b)-(c), and, for all models, Cj < 

for some p > 1. 

Consequently, we obtain 

P A i < j) < L). 4- A.32 

T{t + a2)r(z - ai) 

We finally use (IXMIl to complete the proof of Lemma E31 Take < 6 < = then, by 
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Boole's inequality, 

lb 

p(3z < t' : A(t) < (logt)-) < Y.nD^{t) < (logtr) < aogt)'^^^|^E fl^rT 

i=i ^ i=i ^ 

T{t + a2) r^f-oi) 

This completes the proof of Lemma 13.91 □ 



A. 4 Late vertices have small degree 

Recall the definition of the core Corej in p.ip . where we take cr > 1. In the following theorem we will 
prove that, for models (a)-(c), all vertices with large degree will be early vertices. We need this result to 
prove Theorem II .71 

Proposition A. 3 (Late vertices have small degree). Fix m > 2,5 > —m and a > 1. Then, Core2f ^ [t] 
whp. 

Proof. Note that 

2i 2t 

P(Core2t C [t]) >1- ^{Di{2t) > (log2t)'^) > 1 " 5Z 1^(^(2*) > (log2t)'^) 

i=t+l i=t+l 

= 1 - m[Dt{2t) > (log 2tf'^ , (A.34) 

because vertex t is more likely to have a large degree than vertices added after time t. In Lemma lA.41 we 
will show that P(A(2t) > (log2t)'^) = o (}), so that p(Core2i C [t]j > 1 - o(l). □ 

Lemma A. 4 (Tails of degree distribution). Fix m > 2,6 > —m and a > 1. Then, 

P(^A(2t) > (log 2ty^ = o {1/t) . (A.35) 

Proof. We investigate the problem for models (a) and (b) first, the adaptation for model (c) will be 
discussed later. As noted in Section[Tl for models (a) and (b), Gm,5{2t) can be constructed from Gi^s'{2mt), 
with 6' = 5/m. We will include the superscripts to avoid confusion. Thus identify, for i S [2t], vertices 
((i - l)m + 1)(^), . . . , (im)(^' in Gi^s'i2mt) with vertex i^'") in Gm,5{2t). So (1X35]) is equivalent to 

i^rnt) + ... + (2mt) > (log 2tr^ = a {l/t) . (A.36) 

We will now color the vertices and edges in the following way. Color the vertices 1'^', . . . , ((t — l)m)'^' 
and all edges between these vertices blue and color the vertices ((t — l)m + 1)'^', . . . , (tm)'^' and the m 
edges that are attached to them at time mt red. When a vertex that was added after time mt connects 
to a blue (red) vertex, also color that vertex and its edge blue (red). Color vertices with a self-loop and 
its edge blue. Then, at time 2mt, the total degree of vertices ((t — l)m + 1)'^' , . . . , (tmy^^ is at most equal 
to the number of red edges plus m, because no blue edges are connected to these red vertices, and all red 
edges are connected with at most one endpoint to these vertices. The only exception are the first m red 
edges, which might connect with both endpoints to these vertices, hence we have to add m to the number 
of red edges. Thus, 

P(l)((t„i)^+i)(i) (2mt) + . . . + (2mt) > (log 2t)^) < p(#{red edges} + m > (log 21^^ . (A.37) 
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Since we will bound the right-hand side of the formula above, it is allowed to increase the probability of 
attaching to a red vertex, or, equivalently, to decrease the probability of attaching to a blue vertex. It 
is also allowed to increase the total degree of the red vertices, or to decrease the total degree of the blue 
vertices. All this will only increase the probability of the number of red edges being large. 

Therefore, we are allowed to assume that the first m red edges are all self-loops. Further, we will not 
allow for self- loops after time t, which will increase the probability of attaching to a red vertex in models 
(a) and (b), in model (c) nothing changes. When we consider model (c), we see that the degrees should 
only be updated after each m-th vertex has been added. For j > mt, no more than m edges and vertices 
can be added before updating the degrees, so 



{j + ly^^ connects to a red vertex 



E.(i) red {D,a){m[j/m\) + 6') 
m[j/m\ (2 + 6') 

X^ffi) rod 



^ Z—jv^^i rod 

Thus, we are allowed to update the degrees after adding each vertex, but then we have to lower the total 
weight that blue vertices and edges contribute to the connecting probabilities by m(2 -|- 6'). The above 
bound on the connecting probabilities also holds for models (a) and (b). 

Since we are only interested in the number of red and blue vertices and edges, the problem reduces 
to the following Polya urn scheme. Let there be an urn with, at time s, Si{s) red balls, corresponding 
to the total weight that red vertices and edges contribute to the connecting probabilities, and S2{s) blue 
balls, corresponding to the lowered total weight that blue vertices and edges contribute to the connecting 
probabilities. At time s = we will start with 5i(0) = m{2 + S') and 52(0) = ■m{t - 1){2 + S') - m{2 + 6'). 
We then successively take one ball proportional to the number of balls of a certain color, and replace it 
together with another 2 + 6' balls of the same color. This corresponds to attaching a new vertex to a 
vertex of that color. 



So 



Si (mt) 
2+5' 



has the same distribution as the number of red edges at time 2mt. Consequently, 



A(-) (2i) > (log 2ty 



< 



Si{mt) 
2 + 6' 



+ m> (log2t)'^ 



(A.39) 



To analyse the probability on the right-hand side, we make use of De Finetti's Theorem [28] . This theorem 
states that for an infinite sequence of exchangeable random variables {Xi}^^, Xi G {0, 1}, there exists a 
random variable U with ¥{U G [0, 1]) = 1, such that for all 1 < k < n, 



Xk = 1, -'^fc+i = 0, 



, Xr, 







E 



(A.40) 



The random variable U can be computed explicitly. Note that this implies that 



Y^Xi = k\ = E P('BIN(n, U) = k 



U 



(A.41) 



Let Xi denote the indicator that the i-th ball drawn in the Polya urn scheme described above is red. 
As shown in |3H Section 11.1], {Xi}^^ is an infinite exchangeable sequence. Note that 



5i(s) = i2 + 6')m + {2 + 6')^Xi. 



1=1 



Hence, 



^l^ + m>ilog2tr]=Em) 



(A.42) 



(A.43) 
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where < i/jiu) = F\Bm{mt,u) > (log2t)'^ - 2m j < 1. 
Now observe from [35] that 

^(n) < e-(i°§2tr+2™, (A.44) 
whenever u is such that 7mtu < (log2t)°" — 2m. We define g{t) = ((log2t)°" — 2m)/{7{mt)). Since, 



E 



^{U) = E tP{U) U < g{t) f(u < g{t)^ +E i;{U) U > g{t) f(u > g{tfj 
<^{g{t))+¥(u>g{t)), 



(A.45) 



we obtain, according to ()A.44p . 
' Si{mt) 



46) 



2^^T + ^^ (log2tr^ < e-(i°g2tr+2'"+p(^C/ > g{t)^ + F(u>g{t)y (A. 

It remains to show that also ¥{U > g{t)) = o(j). It turns out that U has a Beta-distribution with 
parameters a = m and /? = m[t—2) (|31|). so a, /3 > 1. Thus we have that the probabihty density function 
of U is unimodular, with its turning point at t = ([35]). It is easy to verify that g{t) > ^"^i^, , for 

t sufficiently large, so that 



m>9{t)) < {i-9{t)) 



([i5]). It is easy to verify that g{t) > ^,+13-2 

^^"^ + {9{t)r~' (1 - 9{t)r' < (1 - 9{t)f . 



r(a)r(/3) 



r(a)r(/3) 



(A.47) 



Using Stirling's formula (see e.g., |T]), one can show that there exists a constant C > 0, such that (|A.47p 
is at most 



C 



/3° 



V{a) 



{l-g{t)f <C{mtr(l 



(log2t)'^ 
8m(t - 2) 



m{t-2) 



because a > 1. 

Note that we in fact proved that F{D^(m)i2t) > (log2t)°') = o{t~'^), for any constant 7. 



(A.48) 
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